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ABSTRACT 

Post-genomic analysis techniques such as 
next-generation sequencing have produced vast 
amounts of data about micro organisms including 
genetic sequences, their functional annotations 
and gene regulatory interactions. The latter are 
genetic mechanisms that control a cell's character- 
istics, for instance, pathogenicity as well as survival 
and reproduction strategies. CoryneRegNet is the 
reference database and analysis platform for 
corynebacterial gene regulatory networks. In this 
article we introduce the updated version 6.0 of 
CoryneRegNet and describe the updated database 
content which includes, 6352 corynebacterial 
regulatory interactions compared with 4928 inter- 
actions in release 5.0 and 3235 regulations in 
release 4.0, respectively. We also demonstrate how 
we support the community by integrating analysis 
and visualization features for transiently imported 
custom data, such as gene regulatory interactions. 
Furthermore, with release 6.0, we provide easy- 
to-use functions that allow the user to submit data 
for persistent storage with the CoryneRegNet 
database. Thus, it offers important options to 
its users in terms of community demands. 
CoryneRegNet is publicly available at http://www 
.coryneregnet.de. 

INTRODUCTION 

The utilization of high-throughput next-generation 
sequencing techniques has made huge amounts of 
genome data publicly available through numerous life 



science databases and data warehouses. To understand a 
cell's response mechanisms to constantly changing envir- 
onmental influences, it is vital to analyse the data and 
extract both the expression of genes and the encoded 
proteins that play a role in the organism's gene regulation 
[transcription factors, (TFs)]. Putting together these pieces 
in conjunction with literature-derived knowledge on the 
regulation of gene expression has made the genome-wide 
reconstruction of transcriptional regulatory networks 
possible. Unraveling these networks on a large-scale facili- 
tates a detailed understanding of an organism's adapta- 
tion strategies to changing environmental conditions. 
Publicly available reference databases provide 
well-structured access to known regulatory interactions 
as well as integrated analysis features. For prokaryotes, 
three typical examples are RegulonDB (1-8) for 
Escherichia coli, PRODORIC (9-12) mainly for 
Pseudomonas aruginosa and MycoRegNet (13) for 
Mycobacterium tuberculosis. For corynebacteria, 
CoryneRegNet is the reference database and analysis 
platform (14-19). Corybnebacteria are important in bio- 
technological production processes as well as human and 
veterinary medicine; see Refs (20-23) for instance. 
CoryneRegNet is an ontolgy-based data warehouse. It 
consists of a hierarchically multi-layered concept of tran- 
scriptional regulation and utilizes a modular data process- 
ing pipeline integrating several software tools that are 
essential for regulatory network reconstruction such as 
detection of clusters of homologous proteins (24), 
binding site motif matching (25), operon prediction (26) 
and special network visualization and graph analysis 
algorithms. 

In this updated article, we describe further improve- 
ments leading to CoryneRegNet version 6.0. The main 
goal was to implement crucial features fulfilling important 
community needs and adding user-customized regulatory 
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interactions thus further increasing CoryneRegNet's ap- 
plicability to typical biological studies. We describe the 
following updates to CoryneRegNet: 

(1) Updated database content including new regula- 
tions and all fully sequenced and annotated 
corynebacteria. 

(2) Predicted version (CoryneRegNet v6.0p): updated 
inter species network transfer pipeline to fully 
support the new database content. 

(3) New options for visualizing transiently imported 
custom gene expression and gene regulation data 
with GraphVis that was not deposited with the 
CoryneRegNet database yet. 

(4) Users are now allowed to persistently submit their 
own data directly to the CoryneRegNet curation 
team by using new options in the web interface. 

In the following, we briefly depict the updated system 
architecture with a detailed explanation of the above 
listed items. Subsequently, we discuss the improvements 
that CoryneRegNet has gained in its current version 6.0. 
We summarize the updated database content and briefly 
show both, the transient and the persistent data import 
facilities. Finally, we sum up and conclude with how 
CoryneRegNet 6.0 broadens its applicability by extending 
its functionality based on community demands thus 
further increasing its contribution to the corynebacteria 
community. 

MATERIALS AND METHODS 

Database content and technical changes 

Figure 1 shows a timeline of how CoryneRegNet evolved 
over its previous versions and how this shifted the focus of 
analysis interests. 

During CoryneRegNet's development process, many 
different applications for this platform's methodology 
have been tested. For CoryneRegNet version 6.0, as for 
previous releases, we updated the database to include all 
fully sequenced and annotated corynebacteria as well as 
E.coli K-12. Table 1 shows the development of the 



database content from version 1.0 to the current version 
6.0. CoryneRegNet 5.0 and 6.0 are available as an 'experi- 
mental' and a 'predicted' version; the latter incorporating 
the results of the inter-species network transfer pipeline 
that we implemented. CoryneRegNet's strategy for pre- 
dicting high potential gene regulations is based on the 
evolutionary conservation of regulatory sites between a 
target organism and a well-researched model organism, 
see Refs (21,27) for details. In CoryneRegNet version 
6.0, we modified the transfer pipeline to fully support 
the updated database content. We use the C. glutamicum 
regulatory network as the template for transfers of known 
regulations to all corynebacterial target organisms. 

To date, CoryneRegNet 6.0's database content includes 
C. aurimucosum ATCC 700975, C. diphtheriae NCTC 
13129, C. efficiens YS-314, C. glutamicum ATCC 13032, 
C. glutamicum R, C. jeikeium K411, C. kroppenstedtii 
DSM 44385, C. pseudotuberculosis 1002, 
C. pseudotuberculosis C231, C. pseudotuberculosis 
FRC41, C. urealyticum DSM 7109 and Escherichia coli 
K-12 while CoryneRegNet 4.0 included C. diphtheriae 
NCTC 13129, C. efficiens YS-314, C. glutamicum ATCC 
13032, C. glutamicum R, C. jeikeium K411, C. urealyticum 
DSM 7109, E. coli K12, M. tuberculosis CDC1551 and 
M. tuberculosis H37Rv. In release 5.0, we utilized the 
network data from C. glutamicum for inter-species trans- 
fers to C. diphtheriae NCTC 13129, C. efficiens YS-314 
and C. jeikeium K411. With CoryneRegNet 6.0, we now 
provide transferred networks for all included 
corynebacteria. 

Visualizing transient custom data via GraphVis 

One new functionality of CoryneRegNet's web interface is 
the visualization and analysis of custom user-derived 
critical data. This comprises regulatory interactions that 
are not integrated with the database yet (unpublished 
data) as well as gene expression data. Therefore, we 
changed CoryneRegNet's visualization tool GraphVis to 
allow the user to import his own gene expression and gene 
regulation data. GraphVis runs client-sided as a browser 
applet thereby ensuring discrete handling of this user's 



CoryneRegNet vl.0: 

• Corynebacterium Glutamicum 

as model organism 



CoryneRegNet v2.0: 
• Binding site analysis 



CoryneRegNet v3.0: 

• Multiple organisms(four 

corynebacteria, E.coli K-12) 



CoryneRegNet v5.0 

(internal release): 

• Interspecies network transfer 

from C. glutamicum to C. 

diphteria/C. efficiens/C. jeikeium 



Evolving CoryneRegNet vl.0 - v6.0 



CoryneRegNet v4.0: 

* Gene expression data analysis 
(COMA) 

• Homology detection (FORCE) 
-> species network comparison 
and analysis 
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CoryneRegNet v6.0: 

• All fully sequenced and 
annotated corynebacteria 

• Community-based system 

• Interspecies network transfer 
from C. glutamicum to all fully 
sequenced and annotated 
corynebacteria 



Figure 1. Development stages of CoryneRegNet — from 2006 to 2011. 
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Table 1. Development of the database content 



Version 


Organisms 


TFs 


Reg. genes 


Regulations 


BMs 


PWMs 


Stimulons 


Clusters 


Publication 


1.0 


1 


53 


331 


430 


192 


23 






(14) 


2.0 


4 


64 


499 


607 


274 


29 






(18) 


3.0 


5 


213 


1632 


2912 


1522 


130 






(15) 


4.0 


7 


213 


1632 


2912 


1522 


130 


8 


4548 


(16) 


5.0e 


1 1 


245 


1986 


3712 


1759 


144 


11 


5421 




5. Op 


1 1 


350 


2888 


4928 


2553 


249 


11 


5421 




6.0e 


12 


245 


1986 


3712 


1759 


144 


14 


3719 


This 


6.0p 


12 


482 


3946 


6352 


3429 


381 


14 


3719 


This 



Note that the database content of CoryneRegNet 5.0e and 6.0e are equal regarding the number of regulations since we gradually kept the database 
up-to-date, as we will do in the future; TFs, transcription factors; Reg. genes, regulated target genes; BMs, binding motifs; PWMs, position weight 
matrices. 



□ cam 




Figure 2. Custom data visualization process. When visualizing an arbitrary known (database) network with the Graph Vis feature, the user may 
utilize several ways to integrate custom data, gene expression data or, as depicted here, transcriptional regulatory interactions. The latter are 
displayed as dotted lines. This allows for a visual analysis of gene expression data, for instance, together with known (database) and uploaded 
(user) networks prior to publication. 



potentially critical data. As applets run in their own 
runtime environment, memory is allocated locally and 
therefore no information about custom data sets is ever 
transferred over the network. The process of visualizing 
user regulations together with known gene regulatory 
networks is easy and intuitive. It is depicted in Figure 2. 

Persistent submission of user data to the CoryneRegNet 
database 

Being the reference database and analysis platform for 
corynebacterial gene regulatory networks, it is important 
to constantly expand the database content with 
user-derived data. As of yet, we exclusively relied on our 
data curation team. CoryneRegNet version 6.0 now also 



allows its users to directly submit their own data persist- 
ently to the CoryneRegNet database. By utilizing an in- 
tuitive web front-end, integrated in the CoryneRegNet 
platform, users may send their data sets to the database 
curators. We decide whether or not the data meets the 
CoryneRegNet's quality standards and if so, accept the 
new data set. Once accepted the data is directly added to 
the CoryneRegNet database persistently. This allows for a 
well-structured and faster provision of new data without 
the error-prone detour via literature mining. This assures 
CoryneRegNet's applicability to the most recent data sets 
on ongoing matters. With this feature, CoryneRegNet 
eases the information exchange and further improves its 
contribution to the (corynebacteria) community. 
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RESULTS AND DISCUSSION 

With CoryneRegNet version 6.0, we updated the database 
content by adding all recently sequenced corynebacteria 
to the database. As we did in the past, the database will 
be kept up-to-date as more regulations are deciphered in 
the wet lab and more corynebacterial whole-genome an- 
notations become available. To date, CoryneRegNet is the 
largest available knowledge base for corynebacterial gene 
regulatory interactions world wide. Moreover, our 
inter-species network transfer integration ('predicted' 
version), roughly doubles the database content of the 'ex- 
perimental' version of CoryneRegNet. 

In addition to the growing database content, we also 
extended our network tool Graph Vis to allow its users 
to import, visualize and analyse their own gene expression 
and gene regulation data with full functionality. This 
mainly allows the biologists to visualize and analyse 
their own, yet unpublished, findings together with 
known data, such as regulatory interactions, from the 
database. It is ensured that the data are kept transiently 
on the client only during runtime and no information is 
exchanged over the network. Additionally, users can 
submit their own data to the CoryneRegNet database 
for persistent storage. Here, we provide the community 
with a direct, self-explanatory interface and thereby limit 
the potential for typing mistakes. This allows for a faster 
handling of evolving new data in a well-structured way. 

CONCLUSION 

CoryneRegNet 6.0 is a versatile systems biology 
platform to support the efficient and large-scale analysis 
of transcriptional regulation of gene expression in 
corynebacterial microorganisms. With the aforementioned 
changes, we extended the CoryneRegNet platform to be a 
community-based system. In conclusion, CoryneRegNet 
remains the reference database and analysis platform for 
corynebacterial transcriptional gene regulatory networks. 
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